Picture for Deva Ramanan

Deva Ramanan

DiffusionSfM: Predicting Structure and Motion via Ray Origin and Endpoint Diffusion

Add code
May 08, 2025
Viaarxiv icon

Generating Physically Stable and Buildable LEGO Designs from Text

Add code
May 08, 2025
Viaarxiv icon

Towards Understanding Camera Motions in Any Video

Add code
Apr 21, 2025
Viaarxiv icon

AerialMegaDepth: Learning Aerial-Ground Reconstruction and View Synthesis

Add code
Apr 17, 2025
Viaarxiv icon

Efficient Autoregressive Shape Generation via Octree-Based Adaptive Tokenization

Add code
Apr 03, 2025
Viaarxiv icon

Accenture-NVS1: A Novel View Synthesis Dataset

Add code
Mar 24, 2025
Viaarxiv icon

Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models

Add code
Feb 10, 2025
Viaarxiv icon

Using Diffusion Priors for Video Amodal Segmentation

Add code
Dec 05, 2024
Viaarxiv icon

Sparse Attention Vectors: Generative Multimodal Model Features Are Discriminative Vision-Language Classifiers

Add code
Nov 28, 2024
Figure 1 for Sparse Attention Vectors: Generative Multimodal Model Features Are Discriminative Vision-Language Classifiers
Figure 2 for Sparse Attention Vectors: Generative Multimodal Model Features Are Discriminative Vision-Language Classifiers
Figure 3 for Sparse Attention Vectors: Generative Multimodal Model Features Are Discriminative Vision-Language Classifiers
Figure 4 for Sparse Attention Vectors: Generative Multimodal Model Features Are Discriminative Vision-Language Classifiers
Viaarxiv icon

LEARNER: Learning Granular Labels from Coarse Labels using Contrastive Learning

Add code
Nov 02, 2024
Figure 1 for LEARNER: Learning Granular Labels from Coarse Labels using Contrastive Learning
Figure 2 for LEARNER: Learning Granular Labels from Coarse Labels using Contrastive Learning
Figure 3 for LEARNER: Learning Granular Labels from Coarse Labels using Contrastive Learning
Figure 4 for LEARNER: Learning Granular Labels from Coarse Labels using Contrastive Learning
Viaarxiv icon